Towards a Part-of-Speech Ontology: Encoding Morphemic Units of Two South African Bantu Languages
نویسنده
چکیده
This article describes the design of an electronic knowledge base, namely a morpho-syntactic database structured as an ontology of linguistic categories, containing linguistic units of two related languages of the South African Bantu group: Northern Sotho and Zulu. These languages differ significantly in their surface orthographies, but are very similar on the lexical and sub-lexical levels. It is therefore our goal to describe the morphemes of these languages in a single common database in order to outline and interpret commonalities and differences in more detail. Moreover, the relational database which is developed defines the underlying morphemic units (morphs) for both languages. It will be shown that the electronic part-ofspeech ontology goes hand in hand with part-of-speech tagsets that label morphemic units. This database is designed as part of a forthcoming system providing lexicographic and linguistic knowledge on the official South African Bantu languages.
منابع مشابه
Idiosyncratic sound systems of the South African Bantu languages: Research and clinical implications for speech-language pathologists and audiologists.
The objective of this article is to create awareness amongst speech-language pathologists and audiologists in South Africa regarding the difference between the sound systems of Germanic languages and the sound systems of South African Bantu languages. A brief overview of the sound systems of two Bantu languages, namely isiZulu and Setswana, is provided. These two languages are representative of...
متن کاملLexical Semantics and Selection of TAM in Bantu Languages: A Case of Semantic Classification of Kiswahili Verbs
The existing literature on Bantu verbal semantics demonstrated that inherent semantic content of verbs pairs directly with the selection of tense, aspect and modality formatives in Bantu languages like Chasu, Lucazi, Lusamia, and Shiyeyi. Thus, the gist of this paper is the articulation of semantic classification of verbs in Kiswahili based on the selection of TAM types. This is because the sem...
متن کاملIdiosyncratic sound systems of the South African Bantu languages: Research and clinical implications for speech-language pathologists and audiologists
Read online: Scan this QR code with your smart phone or mobile device to read online. The objective of this article is to create awareness amongst speech-language pathologists and audiologists in South Africa regarding the difference between the sound systems of Germanic languages and the sound systems of South African Bantu languages. A brief overview of the sound systems of two Bantu language...
متن کاملPhonetics of intonation in South African Bantu languages
Much is already known about the prosodic systems of the indigenous South African languages from descriptions and analyses in the existing literature. All of the existing work has been carried out in the field of African studies or formal linguistics. In order to be able to implement the generalisations obtained into computational models in speech processing, the existing sources and results mus...
متن کاملResource-Light Bantu Part-of-Speech Tagging
Recent scientific publications on data-driven part-of-speech tagging of Sub-Saharan African languages have reported encouraging accuracy scores, using off-the-shelf tools and often fairly limited amounts of training data. Unfortunately, no research efforts exist that explore which type of linguistic features contribute to accurate part-of-speech tagging for the languages under investigation. Th...
متن کامل